Taiwan Hakka Languages and TWHK_ToBI Annotation Conventions

نویسندگان

  • Shao-Ren Lyu
  • Ho-hsien Pan
چکیده

This paper proposes a preliminary prosodic annotation system for Taiwan Hakka, “Taiwan Hakka Tones and Break Indices” is called TWHK_ToBI. TWHK_ToBI includes five tiers: ortho, words, tones, breaks, and miscellaneous. The ortho tier contains Romanization of each syllable and dictionary-defined tones; the words tier includes alphabetized SAMPA spellings of each word; the tones tier includes the sandhi tones for each syllable; the breaks tier indicates degree of juncture including words, fused words, intermediate phrase and intonational phrase boundaries; and the miscellaneous tier labels events such as code switching, laugh, and cough.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Toward Constructing A Multilingual Speech Corpus for Taiwanese (Min-nan), Hakka, and Mandarin Chinese

The Formosa speech database (ForSDat) is a multilingual speech corpus collected at Chang Gung University and sponsored by the National Science Council of Taiwan. It is expected that a multilingual speech corpus will be collected, covering the three most frequently used languages in Taiwan: Taiwanese (Min-nan), Hakka, and Mandarin. This 3-year project has the goal of collecting a phonetically ab...

متن کامل

Multilingual Speech Corpora for TTS System Development

In this paper, four speech corpora collected in the Speech Lab of NCTU in recent years are discussed. They include a Mandarin treebank speech corpus, a Min-Nan speech corpus, a Hakka speech corpus, and a Chinese-English mixed speech corpus. Currently, they are used separately to develop a corpus-based Mandarin TTS system, a Min-Nan TTS system, a Hakka TTS system, and a Chinese-English bilingual...

متن کامل

Construct a multi-lingual speech corpus in taiwan with extracting phonetically balanced articles

In this paper, we describe an initial stage to construct a multilingual speech corpus in Taiwan with selecting phonetically balanced scripts. It is expected to collect a multilingual speech corpus covering three most frequently used languages in Taiwan, including Taiwanese (Min-nan), Hakka, and Mandarin Chinese. To achieve the objective, constructing a multilingual phonetic alphabet, namely For...

متن کامل

The Effects of Lexical Tones and Nasal Coda /-n/ to Sadness in Taiwan Hakka

This paper concerns the relation between the emotion of sadness & lexical tone types, and the relation between the emotion of sadness & nasal coda /-n/ for non-Hakka speakers with Hakka stimuli. We try to probe what factors cause non-Hakka speakers to receive the expression of sadness in Hakka language successfully. The results showed that in both level tones and checked tone, the average f0 va...

متن کامل

Scrub Typhus and Comparisons of Four Main Ethnic Communities in Taiwan in 2004 versus 2008 Using Geographically Weighted Regression

PURPOSE On the main island of Taiwan, a higher risk of scrub typhus infection has been reported in endemic clusters in Southeastern Taiwan and in mountainous township areas. However, research on health care problems associated with scrub typhus in Taiwanese ethnic peoples is limited. This study employs spatial analysis of areal data to determine spatial features related to scrub typhus and the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011